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METHOD AND SYSTEM FOR COMPOSING A QUERY FOR A DATABASE AND 

TRAVERSING THE DATABASE 
ABSTRACT 

5 A system and method of composing a query object for application against a database is 

provided. The method composes a selection clause for the query. Next, a criteria clause for the 
query is generated, with the criteria clause comprising input criteria related to the query and 
additional criteria specified against the query and generated criteria based on a joint relationship. 
Next a source clause utilizing elements in the database accessed by the query is generated. A 

1 0 database traversal system and method is provided. The method identifies all tables directly accessible 
by each table and creating a data structure comprising an entry for each table. The entry comprises 
an identification field for each table and a link field identifying the all tables directly accessible by 
each table. The data structure is traversed and an optimum path of the traversal paths utilizing data 
obtained from traversing the data structure is identified. 
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METHOD AND SYSTEM FOR COMPOSING A QUERY FOR A DATABASE AND 

TRAVERSING THE DATABASE 

FIELD OF THE INVENTION 

The field of the invention relates systems and methods to generate and traverse database 
query structures, in particular systems and methods of efficient organization and compiling SQL 
queries. 

5 

BACKGROUND OF THE INVENTION 

A database management system (DBMS) comprises a computer, data storage devices, disk 
drives and database management software. A relational database management system (RDBMS) is 
a DBMS which uses relational techniques for storing and retrieving information. The relational 

1 0 database management system comprises computerized information storage and retrieval systems in 
which data is stored on disk drives. The data is stored in the form of tables which comprise rows and 
columns. Each row, or tuple, has one or more columns. 

The RDBMS is designed to accept commands to store, retrieve, and delete data. A well- 
known set of commands is based on the Structured Query Language or SQL. The term query refers 

15 to a set of commands in SQL for retrieving data from the RDBMS. The constructs of SQL allow a 
RDBMS to provide a response to a particular query with a particular set of data given a specified 
database content. SQL however does not specify the actual method to find the requested information 
in the tables on the disk drives. The method in which the query is processed, i.e. query execution 
plan, affects the overall time for retrieving the data. Data retrieval time may be critical to the 

20 operation of the database minimizes the computer and disk access time, and therefore, optimizing 
the cost of doing the query. 

Accordingly, there is a need for a dynamic an efficient method and system for generating 
database queries. 



25 



CA9-2000-0048 



1 



02327167 2000-11-30 



SUMMARY OP THE INVENTION 

In a first aspect, the invention provides a method of composing a dynamic query for 
application against a database. First, the method composes a selection clause for the query, with the 
selection clause comprising a results set related to the query. Next, the method composes a criteria 
clause for the query, with the criteria clause comprising input criteria related to the query and 
additional criteria specified against the query. Next the method composes a source clause utilizing 
elements in the database accessed by the query. 

The method may compose an ordering scheme for results of the query. 

The method may compose a grouping scheme for results of the query. 

The method may compose the criteria clause by resolving joint relationships amongst the 
input criteria and the additional criteria. 

The method may compose the criteria clause by adding said joint relationships to the criteria 
clause. Further, the method may compose source clause by resolving a source after analyzing said 
selection clause and said criteria clause. The method may compose the query in SQL format. The 
method may apply the query against the database and results of the query may be provided to an 
output device. 

In a second aspect, a query transaction system is provided. The query transaction system 
comprises a computer, access to a database associated with the computer and a query processing 
program operating on the computer and generating a query for the database. The query processing 
program has a selection clause composing module for the query, the selection clause module 
producing a results set related to the query. The program also has a criteria clause composing 
module for the query, the criteria clause module processing input criteria related to the query and 
additional criteria specified against the query. The program also has a source clause composing 
module utilizing elements in the database identified by the query. 

The query processing program may further comprise an ordering module for results of the 

query. 

The query processing program may further comprise a grouping module for results of the 

query. 
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For the criteria clause composing module of the query processing program, the module may 
have a joint relationships resolving module associating the input criteria to the additional criteria. 
Further, the criteria clause composing module may comprise a module adding the joint relationships 
to the criteria clause. Also, the source clause composing module may resolve the source after 
analyzing the selection clause and the criteria clause. 

In another aspect, an article is provided. The article comprises a computer readable 
information storage medium and a computer readable program encoded on the medium. The program 
comprising a method of composing a query for application against a database. The method 
comprises composing a selection clause for the query, the selection clause comprising a results set 
related to the query, composing a criteria clause for the query, the criteria clause comprising input 
criteria related to the query and additional criteria specified against the query and composing a 
source clause utilizing elements in the database identified by the query. 

The method of the computer program may compose an ordering scheme for results of the 

query. 

The method of the computer program may compose a grouping scheme for results of the 

query. 

The method of the computer program may compose the criteria clause by resolving joint 
relationships amongst the input criteria and the additional criteria. The method may further compose 
the criteria clause by adding the joint relationships to the criteria clause. The method may also 
compose the source clause by resolving a source related to the database after analyzing the selection 
clause and the criteria clause. The method may also apply the query against the database and 
provided results of the query to an output device. 

In another aspect, an article is provided. The article comprises a computer readable 
modulated carrier signal and a computer readable program encoded on the carrier signal. The 
program comprises a method of composing a query for application against a database. The method 
comprises composing a selection clause for the query, the selection clause comprising a results set 
related to the query, composing a criteria clause for the query, the criteria clause comprising input 
criteria related to the query and additional criteria specified against the query and composing a 
source clause utilizing elements in the database identified by the query. 



CA9-2000-0048 



3 



02327167 2000- 11-30 

For the article, the program encoded on the signal may compose an ordering scheme for 
results of the query. 

For the article, the program encoded on the signal may compose a grouping scheme for 
results of the query. 

5 For the article, the program encoded on the signal may compose the criteria clause by 

resolving joint relationships amongst the input criteria and the additional criteria. 

For the article, the program encoded on the signal may compose the criteria clause by adding 
the joint relationships to the criteria clause. The program may compose the source clause by 
resolving a source related to the database after analyzing the selection clause and the criteria clause. 

10 In yet another aspect, a method for evaluating traversal paths amongst tables in a database 

is provided. The database has at least a first and a second table. The method comprises, first, for 
each table, identifying all tables directly accessible by each table and creating a data structure having 
an entry for each table. The entry comprises an identification field for each table and a link field 
identifying the all tables directly accessible by each table. Next, for each entry in the data structure, 

15 the method traverses the data structure to visit all other entries in the data structure, if possible, using 
contents of the link field of each entry. Next, the method identifies an optimum path of the traversal 
paths utilizing data obtained from traversing entries in the data structure. 

The method may track the number of hops taken to visit the all other entries for all possible 
traversal route to the all other entries. The method may have the data structure as a linked list. The 

20 method may traverse the data structure in a breadth first manner. Alternatively, the method may 
traverse the data structure in a depth first manner. The method may identify the optimum path 
utilizing the number of hops taken to visit the all other entries. The method may have the data 
structure further comprising a second link field identifying tables which directly access each table. 
The method may provide the optimum path to an output device. 

25 In yet another aspect, a database analysis system is provided. The system comprises a 

computer, access to a database associated with the computer, the database comprising at least a first 
table and a second table, and a database traversal program associated with the computer. The 
traversal program evaluates traversal paths between the first table and the second table. The traversal 
program has a method which, first, for each table of the plurality of tables, identifies all tables 
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directly accessible by each table and creates a data structure comprising an entry for each table. The 
entry comprises an identification field for each table and a link field identifying the all tables directly 
accessible by each table. Next, for each entry in the data structure, the method traverses the data 
structure to visit all other entries in the data structure, if possible, using contents of the link field of 

5 each entry. Next, the method identifies an optimum path of the traversal paths utilizing data obtained 
from traversing entries in the data structure. 

In yet another aspect, an article is provided comprising a computer readable instruction 
storage medium, a database traversal program encoded on the medium. The program evaluates 
traversal paths in a database. The database comprises at least a first table and a second table. The 

10 database traversal program has a method embodied therein. The method, first, for each table of the 
plurality of tables identifies all tables directly accessible by each table and creating a data structure 
comprising an entry for each table, the entry comprising an identification field for each table and a 
link field identifying the all tables directly accessible by each table. Next, for each entry in the data 
structure, the method traverses the data structure to visit all other entries in the data structure, if 

1 5 possible, using contents of the link field of each entry. Next the method identifies an optimum path 
of the traversal paths utilizing data obtained from traversing entries in the data structure. 

In other aspects of the invention, various combinations and subsets of the aspects described 
above are provided. 

20 BRIEF DESCRIPTION OP THE DRAWINGS 

The foregoing and other aspects of the invention will become more apparent from the 
following description of specific embodiments thereof and the accompanying drawings which 
illustrate, by way of example only, the principles of the invention. In the drawings, where like 
elements feature like reference numerals (and wherein individual elements bear unique alphabetical 
25 suffixes): 

Fig. 1 is the block diagram of a computer accessing a database system utilizing an 
embodiment of the invention; 

Fig. 2 is an exemplary screen shot of software operating on a computer of a search field 
accessing a database of Fig. 1; 
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Fig. 3 is a block diagram of a query structure used to access the database system of Fig. 1 ; 
Fig. 4 is an exemplary set of tables representing data stored in the database system of Fig. 1 ; 
Fig. 5 is a flow diagram of an algorithm of the embodiment of the database system of Fig. 

i; 

5 Fig. 6 is a block diagram of relationship aspects of table elements in the database system of 

Fig.1; 

Fig. 7 is a block diagram of a data structure generated by the embodiment of Fig. 1 ; 
Fig. 8 is a listing code implementing the algorithm of Fig. 5; 

Fig. 9A(i) is a listing of pseudocode associated with a portion of a query building module 
10 associated with the algorithm of Fig. 5; 

Fig. 9A(ii) is a continuation of a listing of pseudocode associated with a portion of the query 
building module of Fig. 9A(i); 

Fig. 9A(iii) is a continuation of a listing of pseudocode associated with a portion of the query 
building module of Fig. 9A(ii); 
1 5 Fig. 9A(iv) is a continuation of a listing of pseudocode associated with aportion of the query 

building module of Fig. 9A(iii); 

Fig. 9B is a listing of more pseudocode associated with a smart query associated with the 
algorithm of Fig. 5; 

Fig. 10 is a block diagram of an exemplary association of tables in a database for an 
20 embodiment of Fig. 1; 

Fig. 1 1 is a block diagram of a data structure representing the table associations of Fig. 10; 

and 

Fig. 1 2 is a block diagram of a distributed computer network utilizing aspects of the embodiment of 
Fig. 1. 

25 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The description which follows, and the embodiments described therein, are provided by way 
of illustrating an example, or examples, of particular embodiments of principles of the present 
invention. These examples are provided for the purpose of explanation, and not limitation, of those 
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principles and of the invention. In the description which follows like elements are marked 
throughout the specification and the drawings with the same respective reference numerals. 

Referring to Fig. 1, computer 100 has software 102 operating thereon allowing queries to be 
made to database 104 which is accessible by computer 100. Database 104 is accessible either 
5 internally or externally via computer 100. Database 104 may be a relational database. Queries to 
database 104 may be in Structured Query Language (SQL). Display 106 provides a visual interface 
for the user of computer 100 when accessing software 102. Software 102 causes user prompts and 
search results of queries to database 104 to be shown on display 106. Data and queries may be 
entered to software 102 via keyboard 107 on computer 100. 
10 Software 102 maybe encoded on disk 108. Disk 108 may be inserted into computer 100 via 

disk drive 1 10 to allow computer 100 to load software 102 into its memory. Alternatively, software 
1 02 may be embodied onto CD-ROM 1 1 2 in an appropriate computer readable code, which may load 
its contents into computer 1 10 via CD-ROM drive 1 14. It will be appreciated that other medium and 
mechanisms may be used to load software 102 on to computer 100 including remote downloads 
15 wherein the software 102 is transmitted to computer 100 from a remote computer utilizing a 
modulated carrier signal. 

Referring to Fig. 2, screen shot 200 shows a typical query screen generated by software 1 02 
and shown on display 106 for accessing database 104. The query screen 200 has fields into which 
a user enters values to compose a query which will be executed against database 104. The fields 
20 include description field 202, manufacturer field 204 and a drop-down menu for price field 206. 
Using computer 1 00, the user enters values for the fields of, e.g. "stove" in field 202, "Sears" in field 
204, then selects a price from field 206, e.g. "$500". The user then activates the "Search Now" 
button 208 which causes software 102 to generate an appropriate SQL query for "stoves" from 
"Sears" which cost "$500" and apply it against database 104. The results of the query are then 
25 provided to the user or the system. The results may be provided on display 106, to a printer (not 
shown), to a disk in a written format, to another database, to another computer or to any output 
device known in the art. 

Referring to Fig. 3, an SQL query generated by software 102 from the query entered by the 
user is shown in query syntax 300. Query syntax 300 comprises "select" clause 302, "from" clause 
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304 and "where" clause 306. "Select" clause 302 identifies the table columns of database 104 from 
which the response to the query is generated. "From" clause 304 identifies the tables from which the 
query will generate its response. "Where" clause 306 identifies specific items 312 from the tables 
for which the user provided specific search criteria. Relating the contents of clauses 302, 304 and 

5 306 to query screen 200, parameters in field 308 identify the description field 202 and manufacturer 
field 204. Parameter 310 identifies the tables in which description field and manufacturer field 
information are stored in database 1 04; parameters in field 312 identify queries relating to description 
field 202, manufacturers field 204 and price field 206. 

Referring to Fig. 4, a representative series of tables 400 relating to database 104 are shown. 

10 In particular, table 402 contains a row listing items contained in the database 102. Table 404 
contains a row listing manufacturers to products and table 406 is a table of products to 
manufacturers. 

Referring to Fig. 5, algorithm 500 provides a flow chart for software 102 of the main 
functional aspects of the embodiment which processes database requests, such as a request generated 

15 from screen shot 200 (Fig. 2), to generate SQL queries, such as SQL query 300 (Fig. 3). First, 
software 102 is initialized at step 502. Initialization may include aspects such as turning on the 
computer 100, initializing the access to appropriate databases and loading appropriate software to 
and from appropriate computers. Next, step 504 initializes query. In this step, a new query is 
defined, appropriate resources are allocated to it and a "select" clause is built. 

20 In step 506 the "where" clause 306 (Fig. 3) is built, by creating a predicate based on the user 

specified search criteria. Next, additional predicates are added to the predicate. For example, if there 
are any common search criteria, they may be provided as an additional predicate and appended to 
the query as hard-coded operands. Utilizing additional set predicates enables a query to be built 
without having to identify repeatedly common query elements for each query. Accordingly, such 

25 hard-coded operands enable the query to be executed faster without parsing additional elements of 
the query. 

Also in step 506, joint predicates are resolved from the user inputs and any common 
predicates. In the embodiment all table relationships, either direct or indirect, are stored in a 
predetermined file, built from a predetermined XML file. The file contains a relationship dictionary 
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which is searched by software 102 to ascertain relationships existing amongst tables. The file is 
parsed and a dictionary of table links is generated. For example, for Tables A, B and C, Table A and 
Table B may be linked through Table C via the relationship TableA.coll=TableC.col2 and 
TableC.col3=TableB.col4. The dictionary entry will have a key of "TableA & TableB" and its 
5 associated element wouldbe "TableA.coll^TableC.coE andTableC.col3=TableB.col4." Itcanbe 
appreciated that such table relationships may be provided through a separate database catalogue. 
Next, joint predicates are added to the predicates to create the "where" clause 306. 

Next, "from" clause 304 is created in step 508. Therein, the source tables are resolved in the 
"from" clause 304 using explicit instructions from the user and implicit information from the source 
10 tables in the "select" clause 302 and "where" clause 306. 

Grouping and ordering of the clauses are performed in steps 510 and 512 and the query 
statement is executed in step 514. 

Referring to Fig. 6, Rose diagram 600 illustrates aspects of SQL statements modelled by 
objects used by the embodiment. Rose diagram 600 comprises a series of objects showing 
15 interrelationships amongst objects by arrows. Each arrow relates child object (source) to a parent 
object (destination). A number associated with the head of the arrow indicates the number of parents 
associated with each child. Using an object oriented design for modelling a SQL statement, 
components for the SQL statement can be dynamically created and manipulated as objects. 

In particular, query object 602 is the central query object interface for the embodiment. It 
20 contains one or more Attribute Info Objects 604. Result object 606 contains data retrieved by 
executing the query. Predicate object 608 may be related in a zero-to-one relationship to query 
object 602. Predicate object 608 models the complex conditions for the related SQL statement. 
Operator object 610 has a one-to-one association with predicate object 608. Operator object 610 
assigns an attribute value to an attribute 612. Attribute object 612 models a searchable attribute. It 
25 is created from Attributelnfo object 604. Attributelnfo 604 is an object containing the metadata of 
each column in the database table. 

For each SQL statement, attribute obj ect 6 1 2 contains an operator object 6 1 0 and an attribute 
value object 6 1 4. Table object 6 1 6 is associated with query object 602, in a one-to-many relationship. 
Smart query object 618 is associated with query 602 as a child. Catalogue query object 620 is 
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associated with smart query object 61 8 as a child. Both smart query object 61 8 and catalogue query 
620 are appended to query 602 using elements of the embodiment in order to streamline operation 
and execution of query 602. Further detail on the operation of Smart Query object is provided later. 

Referring to Fig. 7, relationships associated with predicate object 608 may be used to 
dynamically compose a query tree 700 for the following SQL query in Example 1 : 

Example 1 

Select Tl.referenceNumber, T2.colour 
from CatalogueEntry Tl, AttributeValue T2 

where (TLrefld = "123" and T2.colour = "red") and (Tl.Name = "Sears") and 
(Tl.refld = T2.refld) 



There are two predicates with the query, namely predicate 702 and predicate 704 which both 
comprise an AND operator. Predicate 704 operates on attribute 706 and attribute 708. Attribute 708 

15 associates the catalogue reference ID field of Table 1 ("Tl") with the value of "123". Attribute 708 
associates the Colour Attribute Info of Table 2 ("T2 n ) with colour "red". These tables and values are 
represented by elements 710, 712, 714, and 716, respectively. Attribute 718 equates the MName 
field of Tl, represented by attribute 720, with a value of "Sears", represented by the value 722. 
Attribute 724 equates the Tl.refid field 710 with T2.refld field 726. Predicate 702 operates on 

20 predicate 704, attribute 7 1 8 and attribute 724. 

Referring to Fig. 8, the embodiment constructs code 800 which embodies the SQL query of 
Example 1. First, code 802 constructs simple attribute conditions. Next, composite search 
conditions, using predicates, are constructed through code at 804. Finally, the query is executed 
through code at 806 and results are returned through code at 808. 

25 Details of the pseudo underlying the creation of appropriate data structures for code 800 are 

now provided. 
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Referring to Fig. 9A(i) and Fig. 5, aspects of pseudocode for algorithm 500 used to build a 
results set are shown. First, per step 504, pseudocode in section 900 defines aresults set information 
object 901 for the query. Code in 902 builds a "select" clause for the query by consecutively adding 
Attributelnfo objects 904 into the results set information object 901. 

Referring to Fig. 9A(ii), 9A(iii) and Fig. 5, the first part of a "where" clause is built per step 
506. A series of two parts for a predicate set are built in sections 906a, and 906b. For each 
predicate, an operator is defined at 908a and 908b, then a series of operands are added per sections 
910a and 910b. 

Referring to Fig. 9A(iv) and Fig. 5, the remaining part of the "where" clause is built. Code 
912 creates an object for the joint predicates associated with the query. Code at 914 adds the joint 
relationships to the existing predicates. Finally, SetPredicate code 916 adds the smart query and the 
catalog query predicates to the existing predicates. 

Next, for step 508, code 9 1 8 resolves the source tables for the query. Ordering and grouping 
of predicates by clause (steps 510 and 512) are performed by code 920. Finally, the query is 
executed for step 514 using code 922. 

Referring to Fig. 9B, further detail is provided for the smart query predicate. First, a test is 
conducted to determine whether any hard coded predicates are to be added, per line 924. If hard 
coded predicates exist, they are added to the existing predicate per the code at 926. 

If hard coded predicates do not exist, then joint table predicates are resolved through code 
at 928, embodied specifically in code 930. The joint predicates provide information on how tables 
are related to each other. These relationships are required to conduct a search based on multiple 
tables on a relational database since some information can expand several tables. 

Another aspect of the embodiment provides a system and method of evaluating the number 
of hops between tables when determining links amongst elements in tables when queries are 
executed. 

As noted earlier, when executing a query, multiple tables are often associated with it. 
Accordingly table joint conditions must be specified amongst the tables. There are two types of table 
joints: (i) a direct foreign key relationship, where a column in table A is a foreign key to table B; and 
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(ii) an indirect foreign key relationship, where the foreign key relationships are described in separate 

tables and the relationships may involve several indirect tables. 

Referring to Fig. 10, table relationship 1000 is an example of relationships amongst Table 

A at 1002, Table B at 1004, Table C at 1006, Table D at 1008 and Table E at 1010. Tables in Fig. 
5 1 0 are related by arrows, such as arrow 1012. The tail of the arrow indicates the source table in the 

relationship. The head of the arrow points to the table associated with the source. For Table A, each 

of Table B, Table C, and Table D is associated with it, i.e. Table A can recognize a link to each of 

those tables. Table C is associated with Table B. Tables D and E are associated with Table C. 

Links amongst tables can be direct or indirect. Table A recognizes a direct link to Table C. 
10 Table C recognizes a direct link to Table E. However, Table A can recognize a link to Table E via 

the link provided by Table C. In database operations, links amongst tables are frequently calculated. 

In order to minimize traversal times amongst the tables, any traversal amongst tables should select 

the shortest path. 

In order to determine the shortest path, attributes of tables are traversed to determine all tables 
15 involved in the query. A table graph is then created at initialization. A query framework then 
traverses the table graph to determine the joint predicates for these tables. Then, a composite 
predicate is formed with the user attribute predicates and the table joint predicates. 

To determine a relationship between two tables, the tables in a database are traversed to 
generate a list of all direct foreign references. For each table, an inlist and an outlist is produced. 
20 This information is provided to a mapping comprising many-linked lists. 

After the mapping is generated, to determine a relationship between two tables, the outgoing 
list from the first table is examined. From each element in the outgoing list, the mapping is traversed 
through its outlist until the destination table or a dead-end is reached. For each pass leading to the 
destination table, a variable containing the distance of hops required to get to the destination table 
25 is stored! Accordingly, the shortest path between the originating and destination tables may be 
selected from the path having the smallest number stored in its variable. The shortest path may be 
the optimum path. 

Referring to Fig. 1 1, data structure 1 100 representing elements of the table relationship 
shown in Fig. 10 is shown. Data structure 1 102 represents an entry for Table A; similarly, data 
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structure 1 1 04 represents an entry for Table B ; data structure 1 1 06 represents Table C; data structure 
1 108 represents Table D; and data structure 1110 represents Table E. Data structure 1 102 has an 
infield 1112 identifying all table elements which call on table A. Infield 1 1 12 is empty as there is 
no table which calls on Table A. Outfield 1 1 14 identifies all tables which Table A may access. 
5 These include Table B, Table C and Table D, as indicated by the direction of the arrows on Fig. 10. 
Similarly, data structure 1 1 02 has infield 1116 containing a reference to Table A and outfield 1118 
containing Table C. Similarly, Table C has infield 1 120 containing references to Table A and Table 
B. Outfield 1 122 of data structure 1 106 contains references to Table D and Table E. Infield of data 
structure 1 108 contains a reference to Table A and Table C. Outfield 1 126 of data structure 1 108 
10 is empty. Infield 1 128 of data structure 1110 contains a reference to Table C. Outfield 1 130 is 
empty. 

Accordingly, a linked data structure, such as a linked list, may be generated wherein starting 
from one data structure and traversing through all outfield data elements, a network of linkages 
amongst the table elements may be generated. For example, beginning with data element 1 102, a 

15 link from Table A is made to Table B. Then traversing from Table B in data structure 1 104, a link 
is made to Table C through outfield 1118. Next, a link to data structure 1 106 provides a link to 
Table D through outfield 1 1 22. Finally, Table D data structure 1108 ends with an outfield at outfield 
1 1 26. Accordingly, traversal reverts back up to Table A to determine if any other linkages can be 
made. Accordingly, a link to Table C from outfield 1 1 14 is made. This leads to an access to Table 

20 D through outfield 1 122 of Table C. Following the link through Table D leads to a null field at 
outfield 1 1 26. Reverting back to Table A data structure 1 1 02, Table D entry in outfield 1114 leads 
directly to the null field 1 126 of data structure 1 108. 

The next unresolved outfield is examined. As Table B has all of its outfields resolved, for 
Table C data structure 1 106 is examined for contents of its outfield 1 122, namely table E. At Table 

25 E, data structure 1110 shows that its outfield is null in field 1130. Accordingly, the entire tree has 
been traversed with all elements in this matter. Next, each traversal route can be summed for its 
routing costs. For a system where each traversal is an equivalent cost, it can be shown that by 
traversing the data structures to go from Table A to Table B may be done in one step. Similarly, the 
cost to go from Table A to Table C is either one or two hops. The cost to go from Table A to table 
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D is one, two or three hops. The cost to go from Table A to Table E is two hops. By tracking all 
costing routes, the most efficient route may be selected. It can be appreciated that other algorithms 
may be used to traverse the tree and other costing mechanisms may be used to weight each traversal 
path amongst table elements which may be implemented in other embodiments to determine an 



Referring to Fig. 12, computer network 1200 is shown. Network 1200 comprises network 
system 1202, such as the Internet, which connects computer 100 to server 1204. It can be 
appreciated that software 1 02 may provided to computer 1 00 via server 1 204. Databases 1 04A and 
104B are distributed along network 1202. Server 1204, computer 100 access databases 104A and 
10 1 04B through network 1202. 

As far as the user on computer 100 is concerned, he does not have knowledge of the 
distributed nature of the information coming to computer 100 over network 1202. In the preferred 
embodiment, software 1 02 in server 1 00 utilizes electronic java beans (EJB) to provide access to the 
system. 

15 It is noted that those skilled in the art will appreciate that various modifications of detail may 

be made to the preferred embodiments described herein which would come within the scope of the 
invention as described in the following claims. 
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CLAIMS 

The embodiments of the invention in which an exclusive property or privilege is claimed are defined 
as follows: 

5 

1 . A method of composing a query for application against a database, said method comprising: 
composing a selection clause for said query, said selection clause comprising a results set related to 
said query; 

composing a criteria clause for said query, said criteria clause comprising input criteria 
10 related to said query and additional criteria specified against said query; and 

composing a source clause utilizing elements in said database identified by said query. 

2. A method of composing a query for application against a database as claimed in claim 1, 
wherein said method further comprises composing an ordering scheme for results of said query. 

15 

3. A method of composing a query for application against a database as claimed in claim 1, 
wherein said method further comprises composing a grouping scheme for results of said query. 

4. A method of composing a query for application against a database as claimed in claim 1, 
20 wherein said composing said criteria clause further comprises resolving joint relationships amongst 

said input criteria and said additional criteria. 

5. A method of composing a query for application against a database as claimed in claim 4, 
wherein said composing said criteria clause further comprises adding said joint relationships to said 

25 criteria clause. 

6. A method of composing a query for application against a database as claimed in claim 5, 
wherein said composing said source clause further comprises resolving a source after analyzing said 
selection clause and said criteria clause. 
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7. A method of composing a query for application against a database as claimed in claim 6, 
wherein said query is produced in SQL format. 

8. A method of composing a query for application against a database as claimed in claim 7, 
wherein said method applies said query against said database and results of said query are provided 
to an output device. 

9. A query transaction system comprising: 
a computer; 

access to a database associated with said computer; and 

a query processing program operating on said computer and generating a query for said 
database, said query processing engine comprising: 

a selection clause composing module for said query, said selection clause module 
producing a results set related to said query; 
a criteria clause composing module for said query, said criteria clause module processing 
input criteria related to said query and additional criteria specified against said query; and 

a source clause composing module utilizing elements in said database identified by said 

query. 

10. A query transaction system as claimed in claim 9, wherein said query processing program 
further comprises an ordering module for results of said query. 

11. A query transaction system as claimed in claim 9, wherein said query processing program 
further comprises a grouping module for results of said query. 

12. A query transaction system as claimed in claim 9, wherein said criteria clause composing 
module further comprises a joint relationships resolving module associating said input criteria to said 
additional criteria. 
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13. A query transaction system as claimed in claim 1 2, wherein said criteria clause composing 
module further comprises a module adding said joint relationships to said criteria clause. 

14. A query transaction system as claim in claim 13, wherein said source clause composing 
5 module resolves said source after analyzing said selection clause and said criteria clause. 



15. An article comprising: 

a computer readable information storage medium; 

a computer readable program encoded on said medium, said program comprising a method 
10 of composing a query for application against a database, said method comprising 

composing a selection clause for said query, said selection clause comprising a results 
set related to said query; 

composing a criteria clause for said query, said criteria clause comprising input 
criteria related to said query and additional criteria specified against said query; and 
15 composing a source clause utilizing elements in said database identified by said 

query. 

16. An article as claimed in claim 15, wherein said method of said computer program further 
comprises composing an ordering scheme for results of said query. 

20 

17. An article as claimed in claim 15, wherein said method of said computer program further 
comprises composing a grouping scheme for results of said query. 



18. An article as claimed in claim 15, wherein said method of said computer program composes 
25 said criteria clause by resolving joint relationships amongst said input criteria and said additional 
criteria. 



19. An article as claimed in claim 1 8, wherein said method of said computer program composes 
said criteria clause by adding said joint relationships to said criteria clause. 
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20. A method of composing a query for application against a database as claimed in claim 1 9, 
wherein said composing said source clause further comprises resolving a source related to said 
database after analyzing said selection clause and said criteria clause. 

21. A method of composing a query for application against a database as claimed in claim 20, 
wherein said query is applied against said database and results of said query are provided to an output 
device. 

22. An article comprising: 

a computer readable modulated carrier signal; 

a computer readable program encoded on said carrier signal, said program comprising a 
method of composing a query for application against a database, said method comprising: 

composing a selection clause for said query, said selection clause comprising a results 
set related to said query; 

composing a criteria clause for said query, said criteria clause comprising input 
criteria related to said query and additional criteria specified against said query; and 
composing a source clause utilizing elements in said database identified by said 
query. 

23. An article as claimed in claim 22, wherein said method of said computer program further 
comprises composing an ordering scheme for results of said query. 

24. An article as claimed in claim 22, wherein said method of said computer program further 
comprises composing a grouping scheme for results of said query. 

25. An article as claimed in claim 22, wherein said method of said computer program further 
comprises composing said criteria clause by resolving joint relationships amongst said input criteria 
and said additional criteria. 
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26. An article as claimed in claim 22, wherein said method of said computer program further 
comprises composing said criteria clause by adding said joint relationships to said criteria clause. 

21. An article as claimed in claim 26, wherein method of said computer program composes said 
5 source clause by resolving a source related to said database after analyzing said selection clause and 
said criteria clause. 

28. A method for evaluating traversal paths amongst a plurality of tables in a database, said 
database comprising at least a first table and a second table, said method comprising: 

10 for each table of said plurality of tables 

identifying all tables directly accessible by said each table; and 
creating a data structure comprising an entry for said each table, said entry 
comprising an identification field for said each table and a link field identifying said 
all tables directly accessible by said each table; 

15 

for each entry in said data structure, traversing said data structure to visit all other entries in 
said data structure, if possible, using contents of said link field of said each entry; and 

identifying an optimum path of said traversal paths utilizing data obtained from traversing 
20 entries in said data structure. 

29. A method for evaluating traversal paths between a first table and a second table, as claimed 
in claim 28, said method further comprising tracking the number of hops taken to visit said all other 
entries for all possible traversal route to said all other entries. 

25 

30. A method for evaluating traversal paths between a first table and a second table, as claimed 
in claim 29 wherein said data structure comprises a linked list. 
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31. A method for evaluating traversal paths between a first table and a second table, as claimed 
in claim 30 wherein said data structure is traversed in a breadth first manner. 

32. A method for evaluating traversal paths between a first table and a second table, as claimed 
5 in claim 30 wherein said data structure is traversed in a depth first manner. 

33 . A method for evaluating traversal paths between a first table and a second table, as claimed 
in claim 30 wherein said optimum path is identified utilizing said number of hops taken to visit said 
all other entries. 

10 

34. A method for evaluating traversal paths between a first table and a second table, as claimed 
in claim 33 wherein for said each table of said plurality of tables, said data structure further 
comprises a second link field identifying tables which directly access said each table. 

15 35. A method for evaluating traversal paths between a first table and a second table, as claimed 
in claim 34, wherein a value of said optimum path is provided to an output device. 

36. A database analysis system comprising: 
a computer; 

20 access to a database associated with said computer, said database comprising at least 

a first table and a second table; and 

a database traversal program associated with said computer for evaluating traversal 
paths between said first table and said second table, said database traversal program 
comprising a method comprising: 
25 for each table of said plurality of tables 

identifying all tables directly accessible by said each table; and 
creating a data structure comprising an entry for said each table, said 
entry comprising an identification field for said each table and a link 
field identifying said all tables directly accessible by said each table; 

20 
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for each entry in said data structure, traversing said data structure to visit all 
other entries in said data structure, if possible, using contents of said link field 
of said each entry; and 

5 

identifying an optimum path of said traversal paths utilizing data obtained 
from traversing entries in said data structure. 

37. A database analysis system, as claimed in claim 36, wherein said method further comprises 
10 tracking the number of hops taken to visit said all other entries for all possible traversal route to said 

all other entries. 

38. A database analysis system, as claimed in claim 36 wherein said data structure comprises a 
linked list. 

15 

39. A database analysis system, as claimed in claim 36 wherein said data structure is traversed 
in a breadth first manner. 

40. A database analysis system, as claimed in claim 36 wherein said data structure is traversed 
20 in a depth first manner. 

41. A database analysis system, as claimed in claim 3 6 wherein said optimum path is identified 
utilizing said number of hops taken to visit said all other entries. 

25 42. A database analysis system, as claimed in claim 36 wherein for said each table of said 
plurality of tables, said data structure further comprising a second link field identifying tables which 
directly access said each table. 

43. An article comprising: 
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a computer readable instruction storage medium; 

a database traversal program encoded on said medium for evaluating traversal paths 
in a database, said database comprising at least a first table and a second table, said 
database traversal program comprising a method, said method comprising: 
5 for each table of said plurality of tables 

identifying all tables directly accessible by said each table; and 
creating a data structure comprising an entry for said each table, said 
entry comprising an identification field for said each table and a link 
field identifying said all tables directly accessible by said each table; 

10 

for each entry in said data structure, traversing said data structure to visit all 
other entries in said data structure, if possible, using contents of said link field 
of said each entry; and 

15 identifying an optimum path of said traversal paths utilizing data obtained 

from traversing entries in said data structure. 

44. An article as claimed in claim 43, wherein said method further comprising tracking the 
number of hops taken to visit said all other entries for all possible traversal route to said all other 

20 entries. 

45. An article as claimed in claim 43 wherein said data structure comprises a linked list. 

46. An article as claimed in claim 45 wherein said data structure is traversed in a breadth first 
25 manner. 

47. An article as claimed in claim 45 wherein said data structure is traversed in a depth first 
manner. 
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48. An article as claimed in claim 45 wherein said optimum path is identified utilizing said 
number of hops taken to visit said all other entries. 

49. An article as claimed in claim 45 wherein for said each table of said plurality of tables, said 
data structure further comprises a second link field identifying tables which directly access said each 
table. 

50. A computer program comprising computer program code means adapted to perform all the 
steps of claim 1 when said program is run on a computer system. 

51. The article of claim 22 wherein said signal is a transmission over a network such as the 
Internet. 
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Figure 1 
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Search Criteria 

Fill in at least one field. Fill in more to narrow your search. 
Need high speed? Try Fast Search. 
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Figure 8 



// construct the simple search conditions 

"Attribute attrl = new Attribute (CatRefldAttributelnfo, Operator.equal, "123"); 
Attribute attr2 = new Attribute (Colourttributelnfo, Operator.equal, "red"); 
802 Attribute attr3 = new Attribute (ManufatureAttributelnfo, Operator.equal, "Sears"); 
Attribute attr4 = new 

Attribute(CatRefldAttributeInfo,Operator.equal,DescRefIdAttributeInfo,); 

// compose composite search conditions 

[Predicate pi = new Predicate (Operator.and, {attrl, attr2} ); 
Predicate p2 = new Predicate (Operator.and, {pi, attr3, attr4} ); 

// execute the query 
806 Query q = new Query () 

q.setResultSet ({ CatRefldAttributelnfo, ...})// result set contains catalog entryld 

q.setPredicate (p2); 
808 result = q.execute () 



800 



^QRV 02327167 2000-11-30 



Figure 9A(i) 



public void MCQueryO throws Exception { 
Debug.setLocalTest(true); 

System.out.println("****************** Merchant Centre ********************"); 
900 — CatalogQuery MCQuery = new CatalogQueryQ; — 901 



Jl Result set j— ■ 

^[CQuery.addResultSetInfo(new Result! CatEntryldentifierAttributeInfo.getSingleton()())); 
MCQuery.addResultSetInfo(new Result! StorelnvQuantityAttributelnfo.getSingletonOO)); 
902 — MCQuery.addResultSetInfo(new Result! CatEntDescShortDescAttributelnfo.getSingleton())); 
MCQuery.addResultSetInfo(new Result! CatEntDescNameAttributelnfo.getSingletonO)); 
MCQuery.addResultSetInfo(new Result! CatEntryTypeAttributeInfo.getSingleton())); 
^ICQuery.setDistinctQualifier(true); 



/ 
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Figure 9A(ii) 



// Predicate set 
//Parti 

Predicate pi 1 = new Predicate (); 908a 

pi 1 .setOperator (Operator, or); 

Attribute al 1 1 = new Attribute (CatGrpDescNameAttributelnfo.getSingleton(), Operator.leftlike, 

"CATEGORY X"); 

al 1 1 .setUppercaseQualifier(true); 

p 1 1 .addOperand (al 1 1 ); 

Attribute al 12 = new Attribute (CatGrpDescNameAttributelnfo.getSingletonO, Operator.leftlike, 

"CATEGORY 10"); 

al 12.setUppercaseQualifier(true); 

pi 1. addOperand (al 12); 

Predicate pl2 = new Predicate (); 908a 

pi 2.setOperator (Operator.and); 

"pl2.addOperand (new Attribute (ListPriceAttributelnfo.getSingleton(), Operator.gt, "0.0")); 
pl2.addOperand (new Attribute (StoreInvQuantityAttributeInfo.getSingleton(), Operator.gt, "0.0")); 
pl2.addOperand (new Attribute (lnventoryQuantityMeasureAttributelnfo.getSingletonO, Operator, isnull)); 
pl2.addOperand (pi 1); 

Predicate pl3 = new Predicate (); 908a 

pi 3. setOperator (Operator.and); 

pi 3. addOperand (new Attribute (ListPriceAttributeInfo.getSingleton(), Operator.gt, "0.0")); 

pl3.addOperand (new Attribute (StoreInvQuantityAttributeInfo.getSingleton(), Operator.gt, "0.0")); 

pB.addOperand (new Attribute (InventoryQuantityMeasureAttributelnfo.get Singleton(),Operator.isnull)); 

Attribute al3 = new Attribute (CatGrpDescNameAttributeInfo.getSingleton(), Operator.leftlike, 

"CATEGORY5"); 

al3.setUppercaseQualifier(true); 

pi 3. addOperand (a 13); 



Predicate pl4 = new Predicate (); 908a 

pl4.setOperator (Operator.or); 
910a [pl4.addOperand(pl2); 
p!4.addOperand (pi 3); 
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Figure 9A(iii) 



// Part II 

Predicate p21 = new Predicate (); 908b 

p2 1 .setOperator (Operator.or); 

'Attribute a21 1 = new Attribute (CatGrpDescNameAttributelnfo.getSingleton (), Operator, leftlike, 

"CATEGORY Z"); 

a21 1 .setUppercaseQualifier(true); 

p21.addOperand (a21 1); 

Attribute a212 = new Attribute (CatGrpDescNameAttributelnfo.getSingleton (),Operator.leftlike, 
"CATEGORY9"); 

a2 1 2.setUppercaseQualifier(true); 
p2 1 .addOperand (a2 12); 

Predicate p22 = new Predicate (); 908b 

p22.setOperator (Operator.and); 

p22.addOperand (new Attribute (ListPriceAttributeInfo.getSingleton(), Operator.gt, "0.0")); 
p22.addOperand (new Attribute (StoreInvQuantityAttributeInfo.getSingleton(), Operator.gt, "0.0")); 
p22.addOperand (new Attribute (InventoryQuantityMeasureAttributelnfo.get SingletonO.Operator.isnull)); 
p22. addOperand (p21); 

Predicate p23 = new Predicate (); 908b 

p23. setOperator (Operator.and); 

p23. addOperand (new Attribute (ListPriceAttributeInfo.getSingleton(), Operator.gt, "0.0")); 

p23. addOperand (new Attribute (StoreInvQuantityAttributelnfo.getSingleton(), Operator.gt, "0.0")) ; 

p23.addOperand (new Attribute (InventoryQuantityMeasureAttributelnfo.get Singleton(),Operator.isnull)); 

Attribute a23 = new Attribute (CatGrpDescNameAttributeInfo.getSingleton(), Operator.leftlike, 

"CATEGORY4"); 

a23 .setUppercaseQualifier(true); 

p23. addOperand (a23); 

Predicate p24 = new Predicate (); 908b 

p24. setOperator (Operator.or); 
p24.addOperand (p22); 
p24.addOperand (p23); 
p24.setNotQualifier(true); 
System.out.println(p24.toStringQ); 
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Figure 9A(iv) 
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//Part IV -Join 

Predicate p4 = new Predicate (); 912 

p4.setOperator (Operator.and); 
p4.addOperand (pl4); 
p4.addOperand (p24); 

p4.addOperand (new Attribute (StoreCEntStoreIdentifierAttributeInfo.getSingleton(), 
Operator.eq, "2")); 

p4.addOperand (new Attribute (UsersIdentifierAttributeInfo.getSingleton(), Operator.eq, 
"1001")); 

//p4.addOperand (p33); 
MCQuery.setPredicate(p4); — 916 
// Join 

System.out.println("Auto Join : "); 
MCQuery.printJointRelationships(); 

// Resolve source tables 
MCQuery.resolveSourceTablesO; — 918 

// ORDER, GROUP and HAVING set 

MCQuery.addResultOrder(CatEntryIdentifierAttributeInfo.getSingleton(), 
Operator.desc); — — 920 

System.out.println("MC Query : "); 
System.out.println(MCQuery.toStringO); 

com.ibm.commerce.base.objects.Cursor cursor = new 
com.ibm.commerce.base.objects.CursorO; 
java.util.Vectorv = MCQuery.execute(cursor); — 922 
System.out.println("MC Query first 10 Result: ") 
System.out.println(v); 
cursor. increment(); 

v = MCQuery.execute(cursor); — — 922 
System.out.println("MC Query next 10 Result: "); 
System, outprintln(v) ; 



} 
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Figure 9B 
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public void setPredicate(Predicate aPredicate) throws Exception { 

Predicate additionalP = additionalPredicate(); — 924 
if£additionalP != null) { 

Predicate p = new Predicate(); 

p.setOperator(Operator.and); 

p.addOperand(aPredicate); 

p.addOperand(additionalP); 

setTableJointPredicate(p); 

>" 
else 

setTableJointPredicate(aPredicate); 

} 



private void setTablejointPredicate(Predicate aPredicate) throws Exception { 

Predicate jointP = resolveJointPredicate(aPredicate); 
if(jointP !=null) { 

Predicate p = new Predicate(); 

p.setOperator(Operator.and); 

p.addOperand(aPredicate); 

p.addOperand(jointP); 

super.setPredicate(p); ^\ 
} 930 
else 

super.setPredicate(aPredicate); 

} 
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